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ABSTRACT 

We measure the quasar two-point correlation function over the redshift range 2.2 < z < 2.8 
using data from the Baryon Oscillation Spectroscopic Survey. We use a homogeneous sub- 
set of the data consisting of 27,129 quasars with spectroscopic redshifts — ^by far the largest 
such sample used for clustering measurements at these redshifts to date. The sample cov- 
ers 3,600 deg^, corresponding to a comoving volume of 9.7 (h^^Gpc)'^ assuming a fiducial 
ACDM cosmology, and it has a median absolute i-band magnitude of —26, /s-corrected to 
z = 2. After accounting for redshift errors we find that the redshift space correlation func- 
tion is fit well by a power-law of slope —2 and amplitude sq = (9.7 ± 0.5) h^^Mpc over 
the range 3 < s < 25 h^^Mpc. The projected correlation function, which integrates out the 
effects of peculiar velocities and redshift errors, is fit well by a power-law of slope —1 and 
ro ~ (8.4 ± 0.6) h^^Mpc over the range 4 < i? < 16/i^^Mpc. There is no evidence for 
strong luminosity or redshift dependence to the clustering amplitude, in part because of the 
limited dynamic range in our sample. Our results are consistent with, but more precise than, 
previous measurements at similar redshifts. Our measurement of the quasar clustering ampli- 
tude implies a bias factor of 6 ~ 3.5 for our quasar sample. We compare the data to models to 
constrain the manner in which quasars occupy dark matter halos at z ^ 2.4 and infer that such 
quasars inhabit halos with a characteristic mass of (M) ~ 10^^ H^^Mq with a duty cycle for 
the quasar activity of 1 per cent. 

1 INTRODUCTION 

Quasars are among the most luminous astrophysical objects, and 
are believed to be powered by accretion onto supermassive black 
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holes (e.g. lSalpeteilll964lLvnden-Belllll969l) . They have become 
a key element in our cun'ent paradigm of galaxy evolution - es- 
sentia lly all spheroidal systems at present harbor massive black 
holes JKormendv & Richstondl 19950 . the masses of which are cor- 
related with many properties of their host systems. The emerg- 
ing picture is that quasar activity and star formation are iii ex- 
tricably linked (e.g. iNandra et all (20071; [Silverman et al.ll2008[) in 
galaxies that contain a massive bulge (and thus a massive black 
hole) and a gas reservoir. The galaxy initially forms in a gas-rich, 
rotation-dominated system. Once the dark matter halo g rows to a 
critic a l scale some event — ^most likely a major merger ( Carlberd 
1990'; 'Haiman & Loeb 1998; Cattaneo, Haehnelt & Re ed 1 19991; 



Kauffmann & Haehnelt. 200(1 ; .Springel et al..2005.; .Hopkins et al 
20061) or instability in a cold-str eam fed disk (ICiotti & Ostrikei 



1997ll200ll ; lDi Matteo et al.ll20 12')— triggers a period of rapid, ob- 



scured star formation and the generation of a stellar bulge. After 
some time the quasar becomes visible, and soon after the star for- 
mation is quenched on a short timescale, perhaps via radiative or 
mechanical feedback from the central engi ne (e.g. Ishankanl2009l ; 
lNataraianll2012l ; [Alexander & Hickoxll2012l) . 

The clustering of quasars as a function of redshift and 
luminosity provides useful constraints on our understanding 
of galaxy evolution. The large-scale clustering amplitude in- 
creases with the mass of the dark matter halos hosting 
the quasars. Comparison of the abundance of such halos to 
that of quasars can provide constraints on the duty cycle 
and degree of scat t er in the observable hal o -mass relation 
JCole & R aise? '1989'; 'Martini & Weinberg" '200l'; 'Haiman & Hui' 
2001 1 ; Iwhite. Martini & Cohn 2008; Shankar, Weinberg & Shen 
20101) . However quasars are extremely rare, so very large 
surveys are necessary to suppress the shot-noise from Pois- 
son fluctuations. Samples of quasars have only recently in- 
cluded enough objects to study their clustering with some preci- 
sion (Porciani, Magliocchetti & Norberg 2004; Croom et al. 2005; 
Porciani & No rberel l2006t iHeiinawi et al.l l2006al; iMversetall 
2006. .2007a.bi; Ida Angela et alj 20081 ; IPadmanabhan et alj I2609I ; 
Ross et alj2009l ; IShen et al.ll2009l)~ 



Naively, measuring the clustering of quasars between redshift 
2 and 3 should be a simple task, as this is where the comoving 
number density of luminous quasars seemingly peaks dWeedmar 



number density or turmnous quasars seemingly peaJcs GWeeamaE 
19861 ; iHartwick & Schaddl 19901 ; ICroom et al.l2005l; [Richards et a~ 



2006h . However, selection effects complicate quasar targeting in 
this range. The colours of normal (unobscured) quasars around 
z ^ 2.1 resemble far more abundant st ellar populations, particu- 
larly metal-poor A and F halo stars (e.g.. lFanlll999t [Richards et al.l 



I2OOII) . This issue is enhanced at the faint limits of imaging surveys 
that achieve sim ilar depth to the Sloan Digital Sky Survey (SDSS; 
lYork et al.l2000 1. where those compact galaxies that are dominated 
by A and F stellar populations contaminate selection at the 10- 
20 per cent level as s tar-galaxy separa tion becomes difficult (e.g., 
[Richards et alj|2009l ; IBovv et al.ll2012f) . In addition, faint quasars 
with z ~ 2.2-2.6 can have similar colours to quasars at 2 ~ 0.5, 
which are contamina t ed by redder light from their host galaxy (e.g ., 
iBudavari et ai]|200ll ; [Richards et ^1200 ll ; IWeinstein et"d]|2004i) . 
In combination, the cuts that must be made to efficiently select 
quasars in this redshift range mean that optical surveys of quasars 
may miss a significant number of quasars. A wide-area survey at 
high targeting density is thus an attractive proposition for the study 
of quasar clustering at moderate redshift. The data we consider in 
this paper are drawn from just s uch a survey; the Bary on Oscillation 
Spectroscopic Survey (BOSS; Eisenstein et al.ll201 11) . 

Physical effects also conspire to make quasars difficult to 



sample at z ~ 2.5. Obviously, quasars simply appear fainter at 
greater distances, but in addition quasars seem to exhibit cosmic 
downsizing, with the population of less luminous quasars peak- 
ing at lower redshift JCroom et al.l 120091) . Thus, the most lumi- 
nous quasars are both more abundant and the most visible members 
of the quasar population at 2 ~ 2.5. Quasar cl ustering measure- 
ments at high redshift from the original SDSS l lShen et al.ll2007l) 
therefore only sample the most luminous quasars, implying that 
deeper spectroscopy th an used for the original SDSS quasar survey 
( JRichards et al.ll2002al) is necessary for sampling quasars across a 
large dynamic range in luminosity near z ~ 2.5. Indeed, the final 
BOSS quasar sample should be ~ 10 x larger, and almost 2 magni- 
tudes deeper, than the original S DSS spectroscopic qua sar sample 
at 2.2 < 2 < 3.5 (14,065 objects; ISchneider et al.ll2010l) . 

The outline of the paper is as follows. In !j2] we describe 
the quasar samples that we use — drawn from the SDSS and the 
BOSS. The clustering measurements are described in ij3| includ- 
ing comparisons with earlier work. The implications of our results 
for quasars are explored in ij4] We conclude in ij5] Appendix IaI 
discusses the impact of redshift errors on our measurements while 
Appendix |B] contains the technical details of the model fits used 
in this paper. Where necessary we shall adopt a ACDM cosmo- 
logical model wit h rjmat = 0.274 , Aa = 0.726 and a s = 
0.8 as assumed i n lWhite et alj J201ll) . JAnderson etal] ( 1201 2|) and 
JReid et al.l J2012h . Unless the h dependence is explicitly specified 
or parametrized, we assume h = 0.7. Dark matter halo masses are 
quoted as A/igo;,, i.e. the mass interior to a radius within which 
the mean density is 180 x the background density of the Universe. 
Luminosities will be quoted in Watts, and magnitudes in the AB 
system. 



2 DATA 

The Sloan Digital Sky Survey (SDSS; JYork et a l.' 20001 mapped 
nearly a quarter of the sky using the dedicated Sloan Foundation 
2.5-meter telescope ( JGunn et al.l2006l) located at Apache Point Ob- 
servatory in New Mexico. A drift-scanning mosaic CCD camera 



( Gunn et ah 19981) imaged the sky in five photometric b andpasses 
JFukugita etal.ll 19961 ; ISmith et alj|2002l ; iDoi et alJboiO ") to a lim- 
iting magnitude of r ~ 22.5. The imaging data were processed 
through a series of pipelines that perfor m astrometric calibr ation 
( jPier et al.ll2003l) . photometric reduction fLupto n et al.iuOOll) . and 
photometric calibration (jPadmanabhan et al. 2008|). We use quasars 
se lected from this 5 -band SDSS photomet ry as described in detail 
in lBovv et al] ( I2OIII) and .Ross et al] ( |2012|) . 

Selecting quasars in the redshift range 2 ~ 2-3, where the 
space density of the brightest quasars peaks ( JRichards et al J! 20061 ; 
[Croom et al. 2009), is made difficult by the large populations of 
metal-poor A and F stars, faint lower redshift quasars and compact 
galaxies which have similar col ours to the objects of interest (e.g., 
lFanlll999l ; JRichards et alj|200in . In our case the problem is com- 
pounded by the fact that we wish to work close to the detection 
limits of the SDSS photometry, where errors on flux measurements 
cause objects to scatter substantially in colour space, and that the 
BOSS key science programs did not include the study of the clus- 
tering of 2 ~ 2.5 quasars. 



2.1 Clustering subsamples and the angular mask 

The quasar component of BOSS is designed primarily as a Lyman- 
a Forest survey, which does not require quasars to be selected 



© 0000 RAS, MNRAS 000, 000-000 



Quasar clustering 3 



60° 

50° 

40° 

•I 30° 
c 

1 20° 
Q 

10° 

0° 

-10° 



rn:. 









^plll|1M.Mri'«B~l 



6h 9h 12h 15h 18h 21h Oh 3h 

Right Ascension 



Figure 1. The angular distribution of our quasar sample, in J2000 equatorial 
coordinates and Aitoff projection. We have rotated the reference line by 90° 
so that the North and South Galactic survey regions appear contiguous in 
the left and right parts of the plot, respectively. Areas which we use in our 
analysis (light grey), have completeness to XDQSO targets of greater than 
75 per cent. Other areas (dark grey) are mainly early survey regions where 
XDQSO was not used as the CORE targeting algorithm. The black areas 
depict imaging data in which the u-band chip was not operating, which are 
discarded from our analysis. 




in a uniform — or even a recreatable — manner across the sky. For 
the study of quasar clustering however, uniform selection is key. 
To satisfy these competing scientific requirements, the survey thus 
adopted a CORE+BONUS strategy, where the CORE objects cor- 
respond to a uniform sample selected by the extreme deconvolution 
(XDlj algorithm. XD is applied in BOSS to model the distributions 
of quasars and stars in flux space, and henc e to separate quas ar tar- 
gets from stellar contaminants (XDOSO: IBovv et al.l 120111) . It is 
this CORE sample that we analyze in this paper. 

Specifically, we take as quasar targets all point sources in 
SDSS imaging that have an XDQSO probability above a threshold 
of 0.424 to the magnitude limit of BOSS q uasar target selection 
{g < 22.0 or r < 21.85; IRoss et al.ll2012l) . By quasar targets in 
this sense, we mean all quasars that would have been observed in a 
perfect survey. In reality, not all such targets are observed because 
not all fibers can be placed on an object during normal survey oper- 
ations. More importantly, the XDQSO algorithm was not adopted 
as the final CORE algorithm for BOSS quasar target selection until 
the second year of operations (Ross et al. 2012) . We will work with 
spectroscopy take on or before January 1, 2012 — just over the first 
two years of BOSS data. So, on average, more potential targets with 
an XDQSO probability greater than 0.424 are unobserved (spectro- 
scopically) in areas that were covered in the first year of BOS S (see 

Fig.B- 

We use the MANGLE software JSwanson et al.ll2008h to track 
the angular completeness of the survey (the mask). The complete- 
ness on the sky is determined from the fraction of quasar targets in 
a sector for which we obtained a spectrum; a sector is an area of the 
sky covered by a unique set of spectroscopic tiles (see lBlanton et al.l 
I2OO3I ; iTegmark et al.ll2004h . For our analyses, we limit the survey 
to areas with targeting completeness greater than 75 per cent. By 
targeting completeness, we mean the ratio of the number of quasar 
targets that received a BOSS fiber to all quasar targets. 



^ XD IBovv et alj200g ) is a method to describe the underlying distribution 
function of a series of points in parameter space (e.g., quasars in colour 
space) by modehng that distribution as a sum of Gaussians convolved with 
measurement en'ors. 



Figure 2. The absolute magnitude distribution and number of quasars 
vs. redshift for our sample. (Upper) The lO**", 25*'', 50"^, TS"" and 90*'^ 
percentiles of Mi vs. redshift (see text). (Middle) The same percentiles now 
in Mi — Mi,^i vs. redshift. (Lower) the (normalized) redshift distribution of 
quasars. The vertical dotted lines indicate the redshift ranges we consider in 
our study. 



We do not correct for spectroscopic incompleteness — i.e. ac- 
count for the fraction of observed targets which produce a spectrum 
of sufficient quality to measure a redshift. Quasars at 2: > 2.2 are 
identifiable in BOSS even at very low signal-to-noise ratio because 
the strong LyaA1215 line alwa ys falls within the BO SS wave- 
length coverage (3.600-10.000A; lEisenstein et alj201 i V In BOSS, 
almost all unidentifiable objects are likely to be stars, galaxies or 
low redshift quasars, not the z > 2.2 quasars of interest in this 
paper. Correcting for spectroscopic incompleteness would induce 
a false large scale clustering signal because the density of stellar 
contaminants varies over the sky. 

We apply a veto mask to remove survey regions in which a 
quasar could never be observed — areas near bright s tars and the 
centerposts of the spectroscopic plates (as described in lWhite et al.l 
,2011). We also remove fields where the conditions were not 
deemed photome tric by the SDSS imaging pipeline (again see 
IWhite et al.l201in . Finally, we re move areas that have b ad u data in 
the SDSS imaging scans (§3.3 of lAbazaiian et al.l2004l see Fig.[Tll. 
The resulting XDQSO CORE targets were matched to the list of 
objects for which the BOSS successfully obtained a spectrum, and 
throughout this paper we only consider regions where at least 75 
per cent of the XDQSO CORE targets received a BOSS fiber for 
spectroscopic observation. 

We study BOSS data taken on or before January 1, 2012. 
This limits our analysis to version 5_5_0 of the spectral reduction 
pipeline (spAll-v5_5_0) and to the areas plotted in Fig.[T] Note 
that these are slightly later reductions than made publicly available 
with SDSS Data Release 9 (DR9; which uses v5_4_45). Algorith- 
mically v5_5_0 is the same as v5_4_45, but changes for calibrations 
of a newly installed CCD affects data past DR9. 
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2.2 Redshift assignation 

The BOSS wavelength coverage is 3,600-10,000A so for quasars 
above z ~ 1 the [O III] AA4958,5007 complex is shifted out of 
the BOSS window. Systemic redshifts for most BOSS quasars thus 
rely solely on information from broad emission lines in the rest- 
frame ultraviolet. The next lowest ionization line typically found 
in quasar spectra Mg II A2798, which is a good redshift indicator 
JRichards et al.ll2002bf) . is shifted entirely beyond the BOSS spec- 
tral coverage near z ~ 2.5. To measure quasar redshifts above 
z ~ 2.5 we rely on combinations of the CIII A1908, CIVA1549, 
and LyQ:A1215 lines. In addition to being broad, the centroid of 
each line may be biased — C III] is often blended with Si III], Al III 
and Felll complexes, CIV can be shifted from the systemic red- 
shift by strong outflows, and Ly a is ofte n affected by Ly a Forest 
absorption and is blended w ith NV (e.g.. lVanden Berk et al.ll200ll : 
[Richards et al.l2002bl . l2oTIh . 

To ameliorate these issues, when the BOSS pipelinqj identi- 
fies a quasar spectrum as having an Mg II, C III] or C IV line that 
is within the BOSS wavelength coverage, we de fault to the red- 
shift f rom that line offset using the prescription of iHewett & Wild 
ilOl& i. For spectra with no such lines, we adopt the BOSS pipeline 
redshift. The BOSS collaboration is visually inspecting all quasar 
spectra to check the pipeline redshifts. When a pipeline redshift 
conflicts with the visual redshift (438 objects) we adopt the human- 
corrected redshift. As we mainly analyze clustering on scales that 
correspond to velocities that are larger than typical quasar broad 
lines, altering redshifts by a small amount does not strongly affect 
our results. Appendix IaI discusses redshift errors further. 



2.3 Quasar luminosities and k-corrections 

Fig- m plots the conditional magnitude distribution of our sample, 
compared to the characteristic luminosity of quasars at that redshift. 
W e correct all magnitud es to z = 2 using the fc-corrections derived 
bv ll?ichards et al . I ( 120061) . The characteristi c luminosity — where the 
luminosity function changes slope — from ICroom et al.l ( 120041) . as 
modified bv lCrotonI ( |2009|) . is 

Mi,*(z) = -21.61 -2.5 (fciz + fez^) -0.71 , (1) 

where fci = 1.39 and fe = —0.29 for 2 < 3 and fci — 1.22 and 
k2 = —0.23 for z > 3 . We have converted from the bj band used 
bv lCroom et alj ( 120041) to the i -band (fc-corrected to z = 2) using 
Mi{z = 2) = Mt, - 0.71 dRichards et ai]|2006l) . Note that in 
the range 2.2 < 2 < 2.8, which will be the focus of this paper, 
we are able to probe 1-2 magnitudes fainter than the characteristic 
magnitude. 

Using SDSS broad-band colours to derive fc-corrections for 
our sample is problematic. Most BOSS quasars at redshift 2 ~ 
3 are near the flux limit of SDSS imaging, and so they have 
noisier colo urs than for previous SDSS prescriptions at brighter 
limits (e.g. [Richards et al.l l2006f) . Deriving full fc-corrections at 
2 ~ 3 as a function of flux, colour and redshift is beyond the 
scope of this paper (but see McGreer et al. 2012, in prepara- 
tion). Precise fc-corrections will require proper modeling of subtle 
changes due to, e.g., the Baldwin Effect, the presence of complex 
iron emission crossing through the z-band, and the movement of 
broad lines — which can be offset due to luminosity-and-redshift- 
dependent winds and absorption features — across the SDSS filter 



The BOSS pipeline is described in lAihara et al.l <201 11) 



Sample 


Name 


Redshift 


Magnitude 


JVqso 


1 


All 


2.2 < 2 < 2.8 


-50.0, -10.0] 


27,129 


2 


Bright 


2.2 < 2 < 2.8 


-50.0,-25.8] 


13,564 


3 


Dim 


2.2 < 2 < 2.8 


-25.8,-10.0] 


13,564 


4 


Fid 


2.2 < 2 < 2.8 


-27.0,-25.0] 


19,111 


5 


LoZ 


2.2 < 2 < 2.4 


-27.0,-25.0] 


8,835 


6 


HiZ 


2.4 < 2 < 2.8 


-27.0,-25.0] 


9,977 



Table 1. A summary of the quasar samples we consider. The columns list 
the sample number and name, redshift and magnitude ranges, and the num- 
ber of quasars. Magnitudes are fc-corrected to 2 = 2. 



set. For the redshifts on which we focus in this paper (2.2 < 2 < 
2.8) only the CIIIA1908 line enters the i-band. Fortuitously, this 
complex does not shift much with luminosity, which reduces any 
flux dependence to the fc-correction for our sample. 



3 CLUSTERING 

All of our clustering measurements are performed in configuration, 
rather than Fourier, space. For rare objects, where shot-noise is an 
important or dominant piece of the error budget, the configuration 
space estimators have the advantage of more nearly independent 
errors. They also deal well with irregularly-sampled geometries, 
such as we have for our sample. We shall comp ute both the real- 
and re dshift-space correlation functions, using the lLandv & Szalavl 
([19931) estimator, with a density of random points 50 times the den- 
sity of quasars. 



3.1 The random catalog 

As discussed in ^2. 11 we use MANGLE (I Swanson et alj|2008l) to 
track the angular completeness of the survey. Angular positions 
for the random points, modulated by the angular completeness of 
the survey, were obtained from the MANGLE program ransack. 
To assign redshifts to the random catalog we tried three methods, 
which yielded almost identical results. The first was to assign to 
each point a redshift drawn at random from the data. While this 
method would produce artificial structure in the redshift distribu- 
tion of the random points for a small survey, due to sample variance, 
the wide angular coverage of the BOSS survey ensures this method 
performs well. We also tried fitting splines to the histogram of the 
quasar redshifts and the cumulative histogram of the redshifts and 
using those splines to generate random redshifts. The results were 
insensitive to using the histogram or cumulative histogram, to the 
number of spline points and to the type of spline used. For the re- 
sults presented below we use the first method. 

3.2 Fiber collisions and small-scale clustering 

We cannot obtain spectroscopic data for a few percent of quasars 
due to fiber collisions — no two BOSS fibers can be placed closer 
than 62" on a specific plate. At 2 ~ 2.5 the 62" exclusion cor- 
responds to 1.26/i~^Mpc (comoving). Where possible we obtain 
redshifts for the collided quasars in regions where plates overlap. 
We account for the remaining exclusions by restricting our anal- 
yses to relatively large scales and by up-weighting quasar-quasar 
pairs with separations smaller than 62" . The upweighting is derived 
by comparing the angular correlation function of all targets with 
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Figure 3. The projected correlation function split by hemisphere (or Galac- 
tic latitude), compai'ed to the fiducial sample. The dashed line corresponds 
to the projected correlation function for a real-space correlation function 
with ro = 8 h~^Mpc and a power-law slope of —2 to guide the eye. Note 
the weakly significant excess power at large scales for the south-only sam- 
ple (see text). 



that of the quasars for which we obtained redshifts ( Hawkins et alj 
I2OO3I: iLi et alJlioO^ : IRoss et alj|2007l : IWhite et alj|201lh . This ra- 
tio is close to unity above 62" but drops to about two-thirds below 
62". The number of pairs for which this correction is appreciable 
is quite small, and the impact of this correction is much less than 
la even on the smallest scales. If fiber-collided quasars preferen- 
tially live in regions of higher-than-average density the large-scale 
clustering would be affected by fiber collisions and would not be 
properly corrected by our weighting procedure. The efficiency of 
the tiling algorithm, the prioritization of quasar targets over galax- 
ies and the depth of the survey combine t o make fiber coll isions a 
very small effect on our analysis (see also lRoss et al.ll2009f) . 



3.3 Tests of systematlcs 

We have performed numerous jackknife tests to check whether our 
results are robust to possible systematics. Specifically we have in- 
vestigated whether our results are stable to cuts on targeting, at 
what point in the survey the plates were drilled and what target- 
ing algorithm was used, sector completeness, Galactic latitude and 
hemisphere (Fig. |3}, extinction in the g-hand, target areal density, 
stellar density, raw i-band magnitude (which is a proxy for signal- 
to-noise ratio), sky brightness, g-hand seeing (Fig.|4} and selection 
threshold. In all cases but one we see no evidence for a statistically 
significant systematic effect. The exception is that there is weak ev- 
idence that the large-scale clustering of quasars in the South Galac- 
tic Cap is stronger than that in the North Galactic Cap. It will re- 
quire more data to determine whether this is a statistical fluctuation 
or a significant difference — and, of course, we conducted eleven, 
not one, different tests of systematics. When eleven (independent) 
trials are performed the likelihood of a 2(t detection is quite high 
(~ 40 per cent instead of ~ 5 per cent for a single trial). In fu- 
ture a quasar catalog with good photometric redshifts could help 
with some of these issues. In addition BOSS will continue to obtain 
quasar data until 2014, so we defer a more detailed investigation of 
geographical discrepancies to a future publication. 



Figure 4. The projected correlation function split by whether the median 
seeing in g band in a sector is better than 1.25" (SB125) or worse than 
1.25" (SW125) compared to the fiducial sample. The dashed line corre- 
sponds to the projected correlation function for a real-space correlation 
function with ro = 8 h^^Mpc and a power-law slope of —2 to guide the 
eye. There is no statistically significant difference between the two halves 
of the data. This is typical of the other jackknife tests we have performed. 



3.4 Clustering results 

We have insufficient sensitivity to measure the angular depen- 
dence of the redshift-space clustering induced by redshift space 
distortions for our highly biased quasars. Therefore, we only quote 
redshift-space results from the angle-averaged correlation function, 
which we denote ^(s) at redshift space separation s. Real-space 
clustering is constrained by the projected correlation function 



p{R)= / dZ^{R,Z) 



(2) 



avoiding the need to model redshift space distortions and mitigat- 
ing any effects of redshift eiTors. We truncate the integral over 
the line-of-sight separation, Z, to ±50/i~^Mpc. This value rep- 
resents a trade-off between the goal of fully integrating out the ef- 
fects of redshift space distortions and the disadvantages of intro- 
ducing noise from only weakly correlated structures along the line- 
of-sight and mixing a wide range of 3D scales into a single R bin. 
By 50 h~^Mpc the effects of redshift space distortions are negligi- 
ble, and the truncation has only a modest effect on our largest scale 
point. However this truncation must be kept in mind when precise 
modeling of the data at the largest R is important (see below). 

We divide our quasar sample into bins of redshift over which 
the bias and mass coiTelation function are evolving strongly. For- 
tuitously, on the scales of relevance the effects approximately can- 
cel, i.e. the clustering amplitude stays approximately constant. The 
redshift-bin-averaged ^ can be approximated as a measurement of 
^ evaluated at an effective redshift, Zcft : 



ZeS = 



Jdz{dN/dzf{H/x')z 
Jdz{dN/dz)^H/x^) 



so that 



C(s,^off)^(C(s)) = 



fdz{dN/dzf(H/x^)as,z) 
Jdz{dN/dzf{H/x^) 



(3) 



(4) 
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Figure 5. The projected con'elation functions, Wp (R), for the six samples considered in this paper (Table[T]. The error bars aie the square roots of the diagonal 
elements of the covariance matrices, as determined by bootstrap resampling (see text). The dashed lines show the best fit power-laws with slope —2 (see Table 
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Table 2. The uip data for sample #1, "All", (the largest data set). The first 3 rows list the transverse separation, R, Wp and its error (all in /i~^Mpc). The 
remainder of the table presents the correlation coefficients as estimated from the covariance matrix computed using bootstrapping, as desciibed in the text. 
These values are plotted as the greyscale image in Fig.|2] 



where dN/dz is the redshift distribution of the sample, H is 
the Hubble parameter at redshift z an d y is the comoving 
angular diameter distance to redshift z JMatarrese et al.l 119971 ; 
IWhite. Martini & CohnI 120081) . Assuming passive evolution, con- 
stant halo mass, or constant bias leads to differences between 
^(zcff ) and {Q that are far smaller than our observational errors. 

Results for each of the samples in Table[T]are shown in Figs.|5] 
and|6l and the values for the full sample, with 2.2 < z < 2.8 
and no cuts on magnitude, are given in Table |2] Both the real- and 
redshift-space clustering are fit well by a model with an underly- 
ing power-law correlation function, once the effects of projection 
and redshift errors are taken into account. As it provid es a good fit 
to the data, and for c onsistency with earlier work (e.g. lMvers et al.l 
120061: IRoss eTai][20b 9: Shen 2009), we shall show lines in the fig- 
ures and provide fits in the tables assuming a power-law slope of 
r~^ for the real-space correlation function. The actual slope of the 



correlation function is poorly determined by the projected correla- 
tion function data. Power-law indices from —1 to —2.6 are viable. 
The best-fit slope is quite shallow, near —1.2. The value of Wp{R) 
at 7? ~ 9 h^^Mpc for the best-fit model is almost independent of 
the assumed slope. 

We estimate the cov ariance matrix of ou r measurements by 
bootstrap resampling (e.g. lEfron & Gondl983l) . We d ivide the sur- 
vey in to angular regions specified by HealPix pixels I IGorski et alj 
120051) with A^sido ~ 4 (i.e. approximately 15° on a side). Pixels 
which contain fewer random points than two-thirds of the mean are 
merged with higher occupancy pixels to ensure pixels have simi- 
lar weight. We then estimate both the mean and covariance matrix 
by bootstrap resampling pair counts from a random draw of pixels 
(with replacement). 

The bootstrap-determined correlation matrices for the full 
2.2 < z < 2.8 sample for Wp and ^ are shown in Figs.|7]and[8]re- 
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Figure 6. The redshift-space correlation function, ^(s), for the six samples considered in this paper The error bars are the square roots of the diagonal elements 
of the covariance matrices, as determined by bootstrap resampling (see text). The dashed lines show the best fit power-laws with slope —2 (see Table|3) while 
the dotted lines show the power-law once redshift errors are taken into account (see Appendix IaI. 
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Figure 7. The correlation matrix for the projected coiTelation function, 
■Wp{R), of quasars with 2.2 < z < 2.8 and no cuts on magnitude (i.e. sam- 
ple #1). 



Figure 8. The coiTelation matiix for the redshift-space coiTelation function 
5(s), of quasars with 2.2 < z < 2.8 and no cuts on magnitude (i.e. sample 
#1). 



spectively. Note that the matrices are diagonal-dominated as might 
be expected for shot-noise limited measurements — extending to 
larger scales we see larger correlations between the bins, most no- 
tably in Wp. As we begin to restrict the quasar sample in redshift 
or luminosity, and the number of objects becomes smaller, the co- 
variance matrices and their inverses can become increasingly noisy. 
We have several options at this point — with the most desirable be- 
ing a reduction in the number of degrees of freedom (i.e. data- 
compression) so that we can apply bootstrap resampling to obtain 
a converged covariance matrix, or error on a summary statistic (or 
statistics). The simplest approach would be to reduce the number 



of bins so that we have fewer, better constrained points. However 
this is not optimal for our purposes. We describe below a different 
approach based on the sparsity of our sample and the nature of the 
clustering. 

To begin we note that the correlation functions are fit well 



(X^ 



4.5 for 7 degrees of freedom for Wp, and x = 12 for 



9 degrees of freedom for ^) by power-laws over the range where 
our constraints are tightest (as expected if quasars are hosted by 
massive halos, see Fig. |9]l. We adopt a two parameter model for 
both ^(s) and Wp of the form ^ = (sq/s)^ and 
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Figure 9. The clustering of a sample of halos spanning an octave (i.e. factor 
of 2) in mass centered on 2 X 10^^ h~^MQ. This sample has a naiTow 
range of halo masses with a clear "characteristic" mass while also contain- 
ing enough halos to enable a precision estimate of the correlation function. 
The solid Hne shows the linear-theory, real-space correlation function for 
our fiducial cosmology while the dashed line is a power-law with slope —2. 
The points are computed from halos at z = 2.5 from two simulations of 
3000^ particles (m = 6 X 10 ^° fe~|Mo) each in a 2.75/i-iGpc box 
run using the code described i nlwhitd J2002h. T he halos are found using 
the friends-of-friends method l IDavis et al.ll 19851) with a linking length of 
0.168 times the mean inter-particle separation. In the upper panel the points 
show the mean of the angle-averaged, redshift-space correlation functions 
computed from the periodic boxes in the distant observer approximation. 
The lower panel shows the ratio of the two, which becomes constant at 
large scales. The horizontal dashed line is a fit to the asymptote between 
20 < s < 4 0fe-iMpc. Assuming i{s) = [b^ + (2/3)6/ + f/5]^un 
iKaisejfTgJTh with / = [Qrniz)]°-^'^ gives b ~ 3.9, in good agreement 
with the value inferred from the real-space clustering. The slope and am- 
plitude of the power-law piece of ^(s), and how low in s it extends, varies 
with the particular halo subsample chosen, but the scale-independence of 
the bias at large scales is generic. 



«;p(7?)_0Fr[(7-l)/2] 



R 



r[7/2] 



m 



(5) 



which corresponds to a 3D correlation of the form ^(r) — (ro/r)'' 
integrated to ±oo along the line-of-sight. For 7 = 2 the prefactor 
is n and we shall fix 7 = 2 throughout so that we have one re- 
maining degree of freedom. We fit these models to the measured 
correlations over the range 3 < s < 25/i~^Mpc and 4 < 7? < 
16/i~^Mpc. The non-linear, scale-dependent bias of dark matter 
halos makes their correlation functions close to a power-law on 
Mpc scales (see Figs.[9landllOt but at larger scales the bias becomes 
scale-independent and the coiTelation functions drop more quickly 
than the power-law extrapolation would suggest. In addition, on 
larger scales sample variance becomes increasingly important and 
the radial bins become increasingly correlated (especially for Wp). 
For t hese reasons we limit the fitting range as indicated (see also 
ICroom et al. 2005i) . With more data and a numerical model for the 
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Figure 10. The projected coiTelation function for the same sample of halos 
desciibed in Fig. |9] The dotted and dashed lines coiTespond to power-laws 
of slope —0.8, —1 and —1.2, arbitrarily normalized to the data at /? ~ 
10 h~^Mpc. We imposed an upper limit of ±50 h~^Mpc on the line-of- 
sight separation in Eq. l|2). Changing the mix of halos in the sample can 
change the amplitude and slope of Wp, but the break is a generic feature. 



covariance matrix we could extend the range of the measurement 

and tighten the constraints on quasar models. 

For a power-law correlation function of fixed slope each point 

provides an estimate of the correlation length, so, since Sq = 

s]^{si). The number of pairs in a bin of fixed A log s is A^pair oc 

(1 + ^)s^ and if shot-noise dominates the bins are independent and 

1/2 

the fractional error on 1 + ^(s) in each bin scales as Af j^' . Thus, 

assuming logarithmic bins and that shot-noise dominates we can 

average the estimates with inverse variance weights so that 



1/7 



So = 



^s7^(s,)™« 



E 



Wi 



where the weights are 



1 + 



27-3 



(6) 



(7) 



The values of Wi depend on an estimate, s'q, for so, and reduce to 



Wi oc 



s^+s'o' 



for 7 = 2 



(8) 



Most of the weight in the fit is produced by Si ~ sq, with the weight 
scaling as s^ ' for small s and s"^^^"' for large s. Since the bins are 
assumed to be logarithmically spaced, for 7 ~ 2 this suppression 
is quite rapid in both directions, reflecting the paucity of pairs at 
small s and the weakness of the correlation signal at large s. If the 
correlation function continued as a power-law to large s we could 
tighten our constraints by extending the fitting range, but it is at 
these larger scales where deviations from a power-law behavior are 
most expected, and where the correlations between measurements 
in adjacent radial bins act to weaken the constraints. 

Unfortunately, it is difficult to find a precise redshift for 
quasars in the range 2 < 2 < 3 from optical spectroscopy, so our 
correlation function is smeared by redshift errors which reduce the 
small-scale clustering signal (the dotted lines in Fig. |6j Appendix 
IaI discusses the impact of redshift uncertainties further). We in- 
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elude redshift errors in our model through a multiplicative factor, 
5(s) — > -F(s)^(s), derived in Appendix lAl The best fit sq can be 
derived from a generalization of Eq. (|6) which replaces the weights 
with Wi — )■ F^{si)wi, multiplies the ^(si) in the weights by F{si) 
and divides each term in the numerator of Eq. (|6) by F{si). 

For a given estimate, Sq, the optimal estimate of so can be 
written either as a weighted sum of the ^^ bins or directly in terms of 
the pair counts themselves. One can iterate this estimator, in which 
case the error properties of so become more complex and are best 
handled by a bootstrap procedure. We generate so for each boot- 
strap sample, with the iterative weighting scheme above starting 
from So = 0, and use the standard deviation for our estimate of un- 
certainty. This uncertainty estimate does not include the additional 
contribution from our uncertainty in the redshift error correction. 
A change in redshift error (a^) of ±25 per cent moves the best-fit 
So by 1(7, which can be considered an additional systematic error 
on the fit. While a 25 per cent uncertainty on the redshift error is 
consistent with AppendixlAl we find that reasonable fits to ^(s) can 
be obtained for a wide range of redshift error due to a degeneracy 
between the assumed redshift error and the amplitude (and slope) 
of the underlying correlation function. Because of this additional 
uncertainty, our strongest cosmological constraints will come from 
the projected correlation function, to which we now turn. 

A similar estimator can be used for the real-space correla- 
tion length, ro, under the same assumptions. The weights in this 
case become w^^ oc (2^max + 'Wpi)R-''~ if the integration in 
Eq. |2] extends from — ^max to Zmax. For 7 ~ 2 the weights are 
nearly constant for all the samples we consider for all R of inter- 
est. As for the case of the redshift errors and ^(s), we can account 
for the effects of finite Zmax on Eq. [5]by modifying the weights 
to Wi — >■ F'^(Ri)wi and dividing each term in the numerator by 
F{Ri), whertO i^(i?) = {2/-k) arctan(Zmax/i?) for 7 = 2. 

The clustering strength derived from this procedure is a sta- 
tistically valid summary of the data under the assumption that a 
power-law provides a good fit to Wp . However it does not need to 
be an optimal compression of the available information — if the 
data are sufficiently informative a better constraint on the cluster- 
ing amplitude could be obtained by fitting all of the data. For the 
full sample (#1 in TablefTJ, where we have a reasonably converged 
estimate for the covariance matrix, we can compare the different 
methods. In this case, the likelihood derived from the ro determined 
as above is very similar to that derived from the full covariance 
matrix (or the diagonal elements) indicating that in our case our 
estimate of ro does provide an almost exhaustive summary of the 
constraints available (Fig. lilt. This will be even more the case for 
samples with more shot-noise. Our simple estimator is the preferred 
approach for quoting clustering measurements and errors on sparse 
samples where estimating a full covariance matrix is not feasible. 

In addition to our estimates of ro an d so we also pro- 
vide another summary statistic (motiv ated by ICroom et al.l 120051 : 
Ida Angela et al.ll2008l : lRoss et alj2009l) . 



with Smin — 5h ^Mpc and Sn 
(so/s)^ Eq. l|9j becomes 



^ 



3sg 



\ Sniax / 



1 



+ ■ 



= 20/i"^Mpc. For C(s) = 



(10) 



(11) 



where the last step assumes Smin ^ Smax. We adopt a lower 
limit, Smin 7^ 0, to mitigate the effect of redshift errors and scale- 
dependent bias — Eq.llllshows that this differs from the Smin case 
by 25 per cent for a power-law of index — 2. With our lo wer limit 
the bias inferred from modeling f using the lKaisen (1987J) prescrip- 
tion 



C(s) 



2 ^ 26/ ^r ,, 



(12) 



agrees to 1 per cent with that inferred from the real-space clustering 
for the simulation results described in Figs.|9]and[To] If ^un is used 
in place of ^roai in Eq. [12] the error is also 1 per cent for the case 
shown in Figs.|9]and[T0] though it becomes larger for more biased 
samples. We compute ^ from the data by assuming ^(s) can be 
modeled as a constant within the 10 bins, spaced equally in log 
between Smin and Smax- The values are corrected upwards by 7 
per cent to account for the effect of redshift errors. To allow easy 
comparison with earlier work we use the value of f to estimate the 
bias in Table[3] 

Our results are listed in Table |3] and compared to previous 
work in Figure [12] We do not detect a luminosity or redshift de- 
pendence of the clustering strength, although our sensitivity to this 
dependence is weak due to the limited dynamic range in both vari- 
ables in our sample. When comparing to previous work, we do not 
plot t he last 4 points quoted in Table 2 of the SDSS Data Release 5 
(DR5| Adelman- McCarthv et al.l2007r) quasar clustering analysis of 
IRoss et al.. c2009r) . Due to the flux-limited quasar selection by the 
original SDSS, the projected Wj, measurement for these high- 2: bins 
are quite noisy I Shen et al.l2007l : lRoss et al.l2009l) , and thus unreli- 
able for any direct comparison to our BOSS measurement. We have 
checked that even if we use the SDSS DR7 data this situation is not 
improved for our 2 < z <2> redshift range of interest. 

Our results strongly favor the consensus that quasars inhabit 
rare and highly biased dark matter halos on the exponential tail 
of the mass function. In the absence of merging we would expect 
the clustering of such halos to evolve slowly with time. For our 
assumed cosmology our quasar samples have biases in the range 
3.4-4, consistent with earl y observations and th e extrapolation of 
previous measurements by ICroom et al.l (|2005|) . Estimates in the 
literature on the typical halo mass for a bright quasar at compara- 
ble redshifts vary wildly, in part due to methodological differences 
and the fitting functions assumed. So, we now turn to how quasars 
occupy dark matter halos (see also Appendix IRJ. 



^dsi{s) 



(9) 



^ Assuming that Zmax is large enough that fingers-of-god are correctly 
included and fe 3> 1 so that the anisotropy due to redshift-space distortions 
is small. 



4 INTERPRETATION AND MODELING 

One of the main goals of studying quasar clustering is to pro- 
vide information on the parent dark matter halos hosting luminous 
quasars. The large-scale bias of the quasars provides information 
on the mean dark matter halo mass; the small-scale clustering pro- 
vides information on satellite fractions and potentially radial pro- 
files within halos. Unfortunately the space density of quasars can- 
not be used directly in such constraints because the quasar duty 
cycle (or activity time) is not known. This is a major difference 
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Sample 


Redshift 


^cB 


Magnitude 


Median 


Mean 


ro 


so 


€ 


Bias 


1 


2.2< z< 2.8 


2.39 


-50.0, 


-10.0] 


-25.8 


-25.9 


8.4 ±0.6 


9.6 ±0.5 


0.52 ±0.06 


3.8 ±0.3 


2 


2.2 < 2 < 2.8 


2.41 


-50.0, 


-25.8] 


-26.5 


-26.6 


9.2 ±0.9 


9.6 ±1.0 


0.54 ±0.12 


3.9 ±0.5 


3 


2.2 < 2 < 2.8 


2.36 


-25.8, 


-10.0] 


-25.2 


-25.1 


8.4 ±1.7 


10.2 ±0.8 


0.65 ±0.12 


4.3 ±0.4 


4 


2.2 < 2 < 2.8 


2.39 


-27.0, 


-25.0] 


-25.9 


-25.9 


7.5 ±0.9 


9.2 ±0.7 


0.48 ±0.06 


3.7 ±0.3 


5 


2.2 < 2 < 2.4 


2.28 


-27.0, 


-25.0] 


-25.8 


-25.9 


7.8 ±1.3 


9.5 ±0.9 


0.51 ±0.10 


3.7 ±0.4 


6 


2.4 < 2 < 2.8 


2.51 


-27.0, 


-25.0] 


-25.9 


-25.9 


6.9 ±1.8 


8.8 ±1.1 


0.43 ±0.13 


3.6 ±0.6 



Table 3. A summary of our clustering results. The samples are as in Table[T] The effective redshift is computed using Eq.[3] The ranges, medians and mean 
magnitudes refer to absolute, j-band magnitude fc-corrected to 2 = 2. The congelation lengths are measured in (comoving) h^^Mpc. The 1 cr eiTors on rg, sq 
and 5 are from bootstrap resampling, as described in the text. The error for sq does not include the additional uncertainty due to the redshift en'or from the 
pipeline. The bias is estimated from f using Eqs.l9land ll2l with the fitting function of iSmith et al, l;2003i) for the real-space correlation function of the mass. 
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atively featureless correlation functions we have access to is diffi- 
cult, but the modeling is made easier by a number of facts. Quasars 
are rare, their activity times are short and the fraction of binary 
quasars is very smalrl JHennawi et alJl2006al : lMvers et alJIlOOTbh . 
This suggests that most quasars live at the center of their dark mat- 
ter halos and the majority of halos host at most one active quasar 
To place constraints on the range of halos in which quasars may 
be active we consider two illustrati ve models described below (see 
Appendix |B] for further details and lOe Graf et al.COl ll for a recent 
discussion in the context of numerical hydrodynamic simulations). 
To make interpretation easier we use a quasar sample which is lim- 
ited at both the bright and faint ends, i.e. —27 < Mi < —25 or 
39.0 < logioiboi < 39.8 (see Tabled sample #4, "Fid"). We 
use the ro measurements listed in Table [3] to constrain the mod- 
els, though similar results are obtained using the Wp data and its 
covariance matrix. We also find that the best-fitting models below 
provide a good fit to the redshift space clustering, assuming our 
fiducial model for redshift errors. 



Figure 11. A comparison of fits to the real-space clustering data, using a 
variety of approximations. The model in each case is the clustering of a 
sample of halos covering one octave in mass, centered on Mf^ . The like- 
lihood of the central mass is computed by fitting to the Wp measured for 
sample 1 using the full covariance matrix determined by bootstrap (solid), 
a diagonal covariance matrix (dotted) or the value of ro determined as de- 
scribed in the text (dashed). For this model the central mass has an error of 
0.5 X 10^^ h-^Mp.. 



with studies of e.g., galaxy clustering, and has serious implications 
for the constraints that can be derived. 

Our median quasar has Mi{z — 2) — —26 and so a 
bolometric lumi nosity of Lboi = 2.5 x 10^^ W JCr 



L et al.l 

I2OO5I : IShen et al.l |2009|) . If this object is radiating at the Ed- 
dington limit (LEdd = lO^'O-^Afbii/lO^Afo] W), then the me- 
dian Mbh in our sample is 2 x 10* Mq. As we shall describe, 
our data are consistent with host halos having a characteristic 
mass of 2 x 10^^ h~^MQ (Fig. lilt , in agre ement with earlier 
work jPor ciani. Magliocchetti & Norberg 2004. ICroom et al .120051 : 
IPorciani & Norbergii2006;iLidz et al..2006i) . This v alue is also con- 
sistent with estim ates from the Mbh — Mh relation ( lFerraresel2002l : 
iFine et al."200^. Mh ~ (1 - 3) x 10^^ /i-^Mq, with the differ- 
ences arising from different assumptions about the halo profiles or 
data sets. This result sugges ts L/Ledd ~ 1, consistent with the 
results of ICroom etall ( l2005h . 

However, we do n't expect quasars to inhabit halos of a sin- 
gle mass (though see lShanks et alj|201ir) . Constraining the possi- 
bly complex manner in which quasars occupy halos from the rel- 



4.1 Lognormal model 

In this model we assume that quasars of some luminosity, L, live 
in halos with a lognoimalj distribution of masses, centered on a 
characteristic mass that scales with L (see also Appendix|B]l. Each 
halo hosts at most one quasar with probability 



P{Mh\L) oc exp 



(InMfe-lnMcon(L))' 
2(j2(L) 



(13) 



where the noimalization is set by the observed space-density of 
quasars but does not matter for the clustering. We expect that Mccn 
will be larger for more luminous quasars that are hosted by more 
massive galaxies. 

For illustration we assume that the lognormal form holds for 
quasars in the luminosity bin —27 < Ali < —25, i.e. that the 
luminosity dependence of Mccn is weak. This is a reasonable ap- 
proximation to many models (see Appendix [Bj and in particular 
the type described in the next section. We generate quasar sam- 
ples from halos in the z = 2.4 output of 4 cosmological simula- 

^ When the virial radius of the hosting halo is much smaller than the mean 
inter-quasar separation, the fraction of quasar-hosting halos which contain 
a second quasar scales as Vhaio«QCv ^ 1. where 1;y is the volume aver- 
aged correlation function within the virial radius. 

^ Quasars are known to have a relatively high bias and hence live in halos 
on the steeply falling tail of the mass function. The differences between 
occupation models which include a high-M cut-off and those that do not is 
therefore relatively small. 
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Figure 12. Estimates of the real- and redshift-space correlation lengths for 
our last three samples (#4-#6, solid circles) and from previous work. Open 
squa res are f rom Porciani, Magliocchetti & Norberg (2004), solid triangles 
from lCroom et al. ( 200 4], conver ted from g measurements), ope n 5 -pointed 
stars f rom Porciani & Norbera fcOOft. o pen o ctagons from M^erse^af 
2006 ), 8-pointed stars from IRoss et al.1 1120091). cro sses from IShen et al 



200S ) and 3-pointed stars from iHickox et al.l J201U obscured and unob- 



scured). In the upper panel the dotted line indicates the evolution of tq 
for a sample of halos of a single mass, M = 2 X 10^^ h~^MQ, while 
the dashed line shows the evolution of a passively evolving sample with 
no mergers l IFrvll 19961) . Both lines are meant for illustrative purposes, and 
there has been no attempt to fit to the data. In the lower panel the solid error 
bars plotted exclude the contribution from our uncertainty in the redshift 
error, which is significant (see text). The dashed en'or bars show the effect 
of doubling the errors for our measurements. 



tions, each employing 1500^ particles (m = 2 x 10^° H'^Mq) 
in a 1 /i~^Gpc box. As in Fig. |9l halos are found in the simula- 
tions by the friends-of-friends algorithm with a hnking length of 
0.168 times the mean inter-particle separation. The halos are se- 
lected with a lognormal probability centered on a series of A/ccn- 
To test for sensitivity to the width of the distribution we consider 
o" = 25, 50 and 100 per cent. The former is mo re appropriate for 
higher redshift or brighter quasar s dWhite. Mart ini & Cohn 2008; 
IShankar. Weinberg & She"n 2010t lOe Graf et 3111201 1) while the 
latter is roughly expected based on the amount of observed scatter 



in the local Mbh — Mgai relation. We average the projected corre- 
lation function for each model — with the same binning and Zmax 
as the data — over the 4 simulations to reduce noise, and compute 
a goodness-of-fit. Using either the full bootstrap covariance matrix 
or just the diagonal entries, we find that the best-fit model is good 
for all choices of a. Not surprisingly, we obtain consistent results 
if we fit ro to the average Wp of the simulations using the proce- 
dure of Sj3]and then compare to the value in Table |3] Our measure- 
ments suggest logio A'/con =i 12.00, 12.05 and 12.15 {H'^Mq) for 
a = 25, 50 and 100 per cent, corresponding to an average halo 
mass (logio M) ~ 11.9 - 12.0 (ft-^Afe). 

The majority of high mass galaxies at high redshift are 
the central galaxy in their dark matter halo, so observational 
stellar mass functions can provide constrai nts on the s tellar- 
to-halo-mass relation at z ~ 2 — 3 (see iMoster et al.l uOld : 
iBehroozi. Conrov & Wechsleil2010l for recent examples). In com- 
bination with the constraints on A/cen from quasar clustering, and 
an assumption about the mean Eddington ratio of our sample, 
we can infer the typical Mbh — Mi, relation for our quasars. 
Taking the l ognor mal model and adopting the conversion of 
iMoster et alj ( 1201 0). the average stellar mass is logj^Q(Af*/AfQ) — 
10-10.2 (with larger values corresponding to larger a). If the 
accretion occurs at ALedd the median black h ole mass is 
login (AAfflg/Afp)) = 8.3 using the conversions of dCroom et al.l 
120051 : IShen et alj|2009l) . We compare these numbers to a variety 
of published Mbh — Mh relations in Fig. [13] Our results are in 
broad agreement with the high redshift inferences but predict larger 
black holes (at fixed halo mass) than wha t would be inferred from 
the local relation of iHaring & Rixl ( 120040 . even if we assume all 
of the stellar mass associated with the halo central galaxy is in 
the bulge and that quasars radiate at Eddington (A — 1). This 
result argues that AIbh should increase, at fixed A/*, by a fac- 
tor of approximately 5 A~^ between z — and z ~ 2.4. This 
change is consistent with the increase measured in lensed quasar 
hosts bv |Pengetai] ( l2006h an d the mod el of Hopkins et al. ( 2007 ai). 
By comparison, the model of ICroton et al.l ( l2006l Fig. 1) predicts 
roughly an order of magnitude increase in Mbh at Af* ~ 10^" 
be tween z = and z — 3. On the other hand, the simulations 
of Isiiacki e t al. (^2007 t Fig. 15) p redict almost no evolution at the 
massive end. Merloni et al.l ( 12010 ) infer evolution of Mbh ~ Af* 
from the zCOSMOS survey with a best-fit po wer-law of (l + z)°-^^ 
-a factor of 2.3 between 2 = and 2.4- while lOecarli et al.l ( 1201(1) 
measure a best-fit power-law of (l + z)'^''^^ -a factor of 1.4 between 
z — and 2.4- from a carefully constructed sample of 96 quasars 
drawn from the literature. Our result favors stronger evolution, but 
given the statistical and systematic uncertainties in all of the mea- 
surements, the uncertainties in stellar mass estimates and selection 
biases towards more massive black holes in flux-limited surveys all 
we can say is that it is encouraging that we see evolution in the 
same sense. 

Inverting this argument, we note that, at these high redshifts 
and masses, obtaining a reasonable Mbh — Mh relation provides 
constraints on the possible occupancy distributions of quasars. In 
particular, because the halo mass function is much steeper than the 
galaxy stellar mass function, the typical stellar mass of a central 
galaxy drops steeply with decreasing halo mass — another way of 
stating that galaxy formation is inefficient in low mass halos. If 
black-hole properties are set by the galactic potential rather than by 
halo properties we expect curvature in the AIbh — Mh relation, 
which impacts how we interpret the duty cycle or the active quasar 
fraction. 
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Figure 13. The halo-black hole mass relation. The point with en'ors shows 
the mean and l-cr spread in log Mj, for our lognormal model (as in ^.U 
with 100 per cent scatter. The vertical error bars convert the luminosity 
range to Mbh assuming L = LEdd- The lines indicate Mbh — ^h 
relations inferred from the hterature and from our scahng model. The long 
dashed Hne gives the relation for the scaling model ( i|4.2t which best fits our 
data. The short dashed Une is the result of Ferrarese (2002) and the dotted 
line with the same slope is the result of ,Fine et al. (2006). The lower solid 
line indicates the local scaling relation of [Haring & Rix (2004) assuming 
Mbuie- e equals the stellar mass infen'ed from the relation of iMoster et al J 
120 lOl) at z = 2.4. The upper solid line assumes Mgjj is 5 times larger at 
fixed Mi, than the local relation. Note the curvature of the line due to the 
inefficiency of galaxy formation at high and low halo mass. 



4.2 Scaling-relation model 

Another possibility is that the instantaneous luminosity of the 
quasar is drawn from a lognormal distribution with central value 
proportional to a power of the halo mass (or circular velocity) or 
the central galaxy mass (or dispersion). Physically such a model 
would arise if quasars radiate with a small range of Eddington ra- 
tios and are powered by black holes whose masses are tightly cor- 
related with the mass or circular velocity of the galaxy or host halo 
(through e.g. black-hole bulge, bulge galaxy and galaxy halo corre- 
lations). The lognormal scatter is a combination of the dispersions 
in each of the relations connecting insta ntaneous luminosity to halo 
mass (see e.g. lCrotoij2009l : ISheij2009l for recent examples of such 
models and Appendix|B]for further references). 

We consider two examples here. First we relate the black-hole 
properties to those of the host halo directly. We choose to use the 
peak circular velocity of the dark matter halo as our measure of 
halo size and take log L to be normally distributed around the (log 
of) 



^pk 



r / "peak \ ' 

"" V200kms-iy 



(14) 



The normalization, Lq, is set by matching the clustering amplitude 
at a given luminosity. In principle we can allow the power-law in- 
dex to varjQ Unfortunately the range of luminosities we can probe 



^ For example, an index of 4 would be appropriate to black holes whose 
growth is stopped by momentum-diiven wi nds and 5 for tho se whose 
growth is stopped by luminosity-driven winds jsilk & Reeslll99a) . 



observationally is relatively small and the differences in index are 
difficult to measure. 

For a given Lo and scatter the halo population defines a lumi- 
nosity function of possible quasars. The comparison of this to the 
observed luminosity function of active quasars allows us to set the 
duty cycle, which will be luminosity (and hence halo mass) depen- 
dent. For each model we generate a mock catalog drawn from the 
halos of the simulations introduced in i]4. 11 We impose the duty 
cycle by randomly subsampling the possible quasars to ensure the 
distribution matches the observed luminosity function and then im- 
pose the magnitude limits to match the observed sample. We com- 
pute Wp and fit for ro as described previously. 

Fig. [14] shows the duty cycle for the best-fit model with 
logjQ Lo = 38.8. The duty cycle peaks near one per cent at 
logio Lboi — 39.5, corresponding to Mi{z — 2) ~ —26.3 or 
black hole masses of (2—3) x 10 Mq . This is near the center of our 
magnitude range and in our model corresponds to halos of several 
times 10^^ H^^Mq or Wpcak ~ 300 kms^^. Converting the duty 
cycle into an activity time is somewhat ill defined. If we assume 
tq ~ fontn, with tn the Hubble time, we find Iq ~ 10^ yr. These 
activity times are broadly consistent with those derived at z ~ 0, 
though since the Hubble time is significantly shorter the duty cy- 
cles are significantly higher. Also, note the luminosity/halo mass 
dependence of the duty cycle, which implies that in this model we 
need extra physics to describe the LF beyond simply major mergers 
with a fixed light-curve. 

A second option is to tie the black-hole properties to the host 
galaxy, relating the galaxy properties to those of the halo by abun- 
dance matching. For the stellar masses and dispersions of inter- 
est here the velocity dispersion of the galaxy is proportional to the 
galaxy circular velocity, and we can take the black-hole mass to 
scale as the 4*^'' pow er of either quantity (as in the local universe; 



iTremaine et al.l2002l) . Then 



Lpk — L, 



( ^* V 

° V200kms-i/ 



ocMr 



(15) 



where the power-law index is approximately unity in both obser- 
vations and numerical simulations (for rep resentative examples see 
iHaring & Rix 2004 : Hopkins et alj|2007al . and references therein). 
Fig. [14] shows that in this model the low luminosity slope of the 
luminosity function is in good agreement with the observations 
or the duty cycle has little luminosity dependence — much of the 
suppression of low luminosity quasars can be accomplished by the 
same physics as is invoked to suppress star formation in lower mass 
galaxies. This cut-off in the occupancy to low halo masses tends to 
flatten the run of bias with luminosity (Fig.llSt in a manner similar 
to luminosity dependent lifetime models, where there are also very 
few quasars in low mass halos. 

At the high luminosity end the suppression could be due to in- 
creasing inefficiency in feed ing a black hole as cold-mode accretion 
becomes less effective (e.g. lSiiacki et al.ll200 7l or the fact that it is 
harder to have a major galaxy merger (which would simultaneously 
drive gas to the center and deepen the potential, allowing luminous 
quasar activity) when the stellar mass is increasing slowly with halo 
mass or due to curvature in the Mbh — Afgai relation (e.g. JGrahaml 
I2OI2I) . To illustrate the general point we have randomly subsampled 
the halos above 10^^ H-^Mq by a fraction 2 x 10^'^ /{M + 10^^) 
to obtain the duty cycles shown in the lower right panel of Fig. 1141 
While the agreement is by no means perfect, the general trends are 
in quite good agreement with observations, i.e. the luminosity de- 
pendence of the duty cycle is relatively weak. 

The determining factors for the flatness of the bias-luminosity 



© 0000 RAS, MNRAS 000, 000-000 



Quasar clustering 13 




0.05 0.1 



0.5 
[10«W] 



0.5 
[10«W] 



Figure 14. The duty cycle or active fraction of quasars. The solid h ne is the lumino sity function of quasars from lCroom et al.l 1 120041) . as modified bv lCrotonI 
|2009.), convertin g from bj to Mi as Mi{z = 2) = Mf, ^ - 0.71 JRichards etai][2006.) and from Mi to Lbol via Mi{z = 2) = 72.5 - 2.51ogio(Lbol) 
iShen et alj2009h . The left hand plot shows, as the histograms in the upper panel, the predicted luminosity functions (divided by 100) from the scaling relation 
model with Lbol ~ ""^ k ^^'^ ^^' ^^ ^'^'^ ^^^ ^^^ ^^^^ scatter, assuming all black holes are active. The lower panel shows the fraction that needs to be 
active at any given time in order to obtain the observed luminosity function (solid line). This is the duty cycle. The right hand plot shows the same parameters 
for a model with Lbol ~ ^'^ ^1 ^"'^ ™ which we have randomly sampled halos above 10^^ H^^Mq as described in the text. The possible quasars in our 
simulations are downsampled by these duty cycles to ensure a perfect match to the luminosity function as described in the text. 




Figure 15. The predicted run of bias with magnitude for quasar samples two 
magnitudes wide in Mi{z = 2) centered on the x-ordinate. There are two 
sets of curves, each with scatter in Lobs of cr = 25 (dotted), 50 (dashed) 
and 100 per cent (long dashed). The steeper curves coiTespond to the model 
with Lbol ~ ''oak ^hile the flatter curves conespond to Lbol ~ M ^j . 
In each case the active fraction has been adjusted to match the observed 
luminosity function, as in Fig. 1141 The dependence on the amount of scatter 
assumed is small because of the two magnitude wide bin that is taken for 
each sample to increase statistics. 



relation (Fig.llSt is the slope of the halo mass observable (i.e. lu- 
minosity) relation and the degree of scatter in that relation. For 
very high clustering amplitudes (as measured at high z) one ob- 
tains an upper limit on the scatter, which can be quite constraining 



for mod els, but at intermediate z quite a large degree of scatter is 
allowed if White. Martini & Cohrj20o3:IShanJkar. Weinberg & Sheij 
201G). The scatter can arise from a number of sources including lu- 
minosity dependent lifetime, but also stochasticity in the relations 
between halo mass and galaxy mass, galaxy mass and central po- 
tential well depth, potential well depth and black hole mass, black 
hole mass and optical luminosity. For reasonable values of halo- 
observable slope, luminosity bin width and stochasticity, one ob- 
tains quite flat b{L) - whether or not there is a sharp cut-off in 
the halo mass distribution. Thus while it is definitely plausible that 
quasar lifetime is luminosity dependent, it is not strictly required 
by the current clustering data at these redshifts. 

The run of bias with halo mass becomes quite shallow at the 
low mass end. If quasar luminosity is additionally affected by the 
inefficiency of galaxy formation in lower mass halos, we expect 
to see a luminosity dependent quasar bias primarily at higher lumi- 
nosities (and redshifts). Unfortunately this is where quasars become 
increasingly rare, which argues that cross-correlation may be the 
best m eans of obtaining a strong clustering signal (e.g. Ishen et al.l 
120091) . We explore this issue briefly in Fig. [16] where we show 
the number of quasar pairs we expect to see in a complete, 10^ 
sq. deg. survey with separation < 20ft^^Mpc in bins of redshift 
and magnitude. In the black areas, with more than 10'^ pairs, a solid 
detection of clustering from the auto-correlation should be possi- 
ble. In the other regions (or for smaller or less complete surveys) 
cross-coiTelation with other quasars or a different tracer is likely 
required. 

By contrast the bias at the low luminosity end is an effec- 
tive discriminator between models where quasar luminosity de- 
pends on halo mass or galaxy mass. Unfortunately BOSS is unable 
to probe this range of quasar luminosities. Future surveys which 
probe fainter magnitudes may be able to settle this question. It may 
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Figure 16. The number of quasar pairs with separation < 20 h^^Mpc in 
the red shift and magnitude bins shown, as predicted by the luminosity func- 
tion of Croorn et al. (2004) as modified by Croton (2009). We assume all 
quasars in the redshift and magnitude bin are observed over 10^ sq. deg. of 
sky. Reducing the sky area reduces the pair counts linearly while finding 
only a fraction of the quasars reduces the counts quadratically. The black 
regions indicate more than 10'^ pairs, the grey regions indicate between 10^ 
and 10^ pairs and the white regions fewer than 10^ pairs. A solid detection 
of clustering from the auto-correlation should be possible in the black re- 
gions while in the white regions it is likely necessary to cross-correlate with 
more numerous samples or to make wider bins in redshift or magnitude. 



also be possible to cross-correlate faint, photometrically selected 
quasars with the brighter spectro scopic quasars from the B OSS 
sample (using e.g., the methods in lMvers. White & Ballll2009n . 



4.3 Redshift evolution 

In Fig. [it] we compared the clustering of our quasar sample to that 
of other well-studied objects at z ~ 2. The quasar clustering am- 
plitude is similar to that of submillimetre galaxies, suggesting they 
live in similar mass halos, and stronger than the typical star form- 
ing population. This is consistent with an evolutionary picture in 
which a merger triggers a massive starburst that creates a submil- 
limetre galaxy an d which is then quenched b y the foimation of a 
bright quasar (e.g. [Alexander & Hickoxll2012h . The descendants of 
our quasar hosts will have comparable clustering amplitudes to the 
quasars themselves, indicating that they will likely evolve into mas- 
sive, luminous early-type galaxies at low redshift. 

To better understand the possible fate of quasar host halos we 
employ another high-resolution N-body simulation which allows 
us to track halos and sub halos down to z = (for details see 
IWhite, Cohn & Smil|[20lQ) . We select all halos at z = 2.4 which 
are central subhalos of halos which lie within an octave (i.e. fac- 
tor of 2) in mass centered on 2 x 10^^ H'^Mq (> lO" particles). 
Essentially all of these halos are the most massive in their local 
environment. One quarter of the hosts fall into a large halo (be- 
coming a satellite) and then lose more than 99.9 per cent of their 
mass (falling below the resolution limit of the simulation) without 
merging with the central galaxy or another satellite subhalo. To the 
extent that this process is well resolved, the stars in any galaxies 
hosted by these subhalos would likely contribute to an intra-halo 
light component, while the black holes would form a freely float- 
ing component or be associated with highly stripped satellites. Of 



Figure 17. Comparison of our measured correlation length (square with 
error bars) to that of other objects at z ~ 2. In each case the hori- 
zontal error bar or width of the shaded region shows the redshift range 
over which the measurement is performed while the vertical error bar or 
height of the shaded region shows the it 1 it regi on. The other da ta sets 
are mid-IR selected star-forming galaxies (MIPS: IgjIU et al.ll2007l) . BM, 
BX and LBG star- forming galaxies (Adelberger et al. 2005), submillime- 
tre galaxies (SMG; l Hickox et alj|2012) and dust-obscured galaxies (DOG; 
iBrodwin et al.ll2008l) . The lines show the clustering of halos with peak cir- 
cular velocity 250 , 3 00 and 350 km s~^ as measure d from the simulations 
described in i|4.1l See lKrumpe. Mivaii & Coij i20ld) . Figs. 9-1 1, for a sim- 
ilar comparison at lower z. 



the remaining 75 per cent of the subhalos, three quarters are the 
most massive progenitor in all their subsequent mergers and remain 
central galaxies to z = Q. The remaining quarter become satellites 
which survive to z = inside more massive halos. As a whole 
the population inhabits 2: ~ halos over a broad range of masses 
(2 X 10^^ - 10^^ H-^Mq) peaking at (1 - 2) x 10^^ /i'^A/q. Ha- 
los of this group scale host galaxies of a few L* today, a population 
dominated by elliptical galaxies. This wide diversity of outcomes is 
reminiscent of the varied fates of 2 ~ 2 star-forming galaxies (e.g. 
IConrov et al.ll2008l) . As constraints on the stellar masses of galax- 
ies within halos become tighter, comparison of the stellar masses 
of quasar hosts with that of their 2 ~ descendants will put con- 
straints on the star-formation history of these objects. 

The evolution of clustering with time can place strong con- 
straints on how ep isodic quasar activity can be. As emphasized by 
ICroom et al.l ( l2005h , if the typical host of quasars does not evolve 
significantly with redshift then quasars cannot be repeated bursts of 
the same black hole because such black holes would live in halos 
which grow in mass as the Universe evolves. However the quanti- 
tative strength of this statement, and the allowed fraction of objects 
which could burst more than once, is difficult to assess. One issue 
is the size of the observational errors, the other is that higher red- 
shift samples typically probe more mas sive black holes than l ower 
redshift samples (as emphasized e.g. by Hopkins et al. 2007b). 

For ex ample, the highes t redshift clustering measurement 
comes from IShen et al.l (|2007|), at 2 ~ 4. Their result is consis- 
tent with host halos of (2 - 10) x 10^^ H'^Mq. Such hosts would 
grow in mass by a factor of approximately 4 between 2: = 4 and 
2.4. Similarly, the progenitors of our 2 x 10^^ /!,~^M0, z = 2.4 
halos are approximately 4 times less massive at z — 4. This sug- 
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gests on average an order of magnitude mismatch in halo masses, 
but with a large error. Assuming no evolution in Eddingto n ratio 
between z = 4 and z = 2A the quasars in the Shen et alj OOOTD 
sample are a factor of about 5 brighter than the BOSS quasars. If the 
power-law index of the Mbh — Mh relation is close to unity there 
is little tension from clusteringj in assuming quasars episodically 
burst. 

The time span between z — 4 and z — 2.4 in our adopted 
cosmology is 1 ft~'^Gyr. The Universe is another IftT^Gyr older 
at z ~ 1.5. Taking ro at z — 1.57 from lRoss et al.l ( 120091) and con- 
verting it to a host halo mass we obtain ~ 5 x 10^^ h~^MQ. The 
descendants of our 2 x 10^^ h~^ Mq, z = 2.4 halos are approxi- 
mately twice as massive at z = 1.5, again leading to little tension 
in a model with episodic outbursts. 

We can see the issues most clearly in Fig. [T2] If halos moved 
with the same large-scale velocities as the dark matter and never 
merged the decrease of their bias wou ld approx imately cancel 
the increased clus tering of the matter l lFrvlll996t see Fig. I of 
Iwhite. et alJlOOTO . For a highly biased sample such passive evolu- 
tion corresponds to almost constant clustering strength, for slightly 
less-biased objects the clustering strength grows slowly with time. 
This result is shown as the dashed line in Fig. [T2l corresponding 
to objects with b{z = 0) = 1.8. Merging of halos and includ- 
ing a finite range of halo masses alters the details of this evolu- 
tion, but keeps the sense unaltered. By contrast, halos of a fixed 
mass predict a clustering strength which drops slowly with time - 
shown as the dotted line in Fig.[T2]for halos of 2 x 10^^ /I'^M©. 
Again, a more realistic scenario with a finite range of halo masses 
has the same sense of the evolution. The current data are not in 
strong conflict with either scenario. If a random fraction of quasars 
repeats at later epochs while new quasars always appear in halos 
of a fixed mass, the measured clustering resembles a sample with 
bias /repeat ^repeat + (1 ^ /repeat )&new. The measured values of ro 
are consistent with being roughly constant over 1 < 2: < 3. For 
this reason it is difficult to put a strong upper limit on the fraction 
of quasars that turn on more than once or the maximum number of 
times a given quasar can burst. 



5 CONCLUSIONS 

We have performed real- and redshift-space clustering measure- 
ments of a uniform subsample of quasars observed by the Baryon 
Oscillation Spectroscopic Survey (BOSS). These quasars lie in the 
redshift range 2.2 < z < 2.8 where there has previously been a gap 
in coverage due to the fact that the quasar and stellar loci cross and 
the difficulty in targeting quasars in large numbers to faint fluxes. 
We detect clustering at high significance in both real- and redshift- 
space for the entire sample and subsamples split by redshift and 
luminosity (see Table[3](. We do not detect a luminosity or redshift 
dependence of the clustering strength, although our sensitivity to 
this dependence is weak due to the limited dynamic range in both 
variables in our sample. 

The two-point correlation functions are consistent with power- 
laws over the range of scales measured, with an underlying real- 
space clustering of the form ^(r) — (ro/r)^. The correlation 
length, ro, does not appear to evolve strongly over the redshift 
range z ~ 3 to 1. This result is consistent with passive evolution 



' The dramatic decrease in quasar numbers to higher redshift does impose 
constraints. 



of a highly biased population, although this interpretation is by no 
means unique. Our results are consistent with quasars living in ha- 
los of typical mass 10^^ h~^MQ at 2: ~ 2.4, in line with expecta- 
tions from earlier surveys. The measured bia s and space density o f 
quasars can be used to infer their duty cycle (ICole & Kaiseilll989r) . 
For our best-fit models the duty cycle peaks at one per cent, imply- 
ing an activity time of ~ 10^ years. This time is comparable to the 
activity times inferred for quasars at lower redshift, although the 
Hubble time at z — 2.4 is shorter than at lower z and hence the 
active fraction is larger. 

The typical host halo mass is similar to the inferred hosts of 
submillimetre galaxies and is more massive than the inferred hosts 
of typical star-forming galaxies at the same redshift. This interpre- 
tation is in turn consistent with an evolutionary picture in which a 
massive starburst creates a submillimetre galaxy and is quenched 
by the formation of a bright quasar. While the typical descendant 
of the halos that host BOSS quasars is likely to host a luminous, 
elliptical galaxy at the present time, we find a wide diversity in de- 
scendants in N-body simulations. 

Using abundance matching to infer the properties of quasar 
host galaxies we find evidence for evolution in the Mbh — Afgai 
relation in the sense that black holes must be ~ 5 x more massive 
at fixed galaxy mass at z = 2.4 than at 2; ~ 0. We find that the 
predictions for how quasar activity and clustering (bias) depend on 
luminosity differ depending on whether we take as our fundamen- 
tal relationship a black hole-halo correlation or a black hole-galaxy 
correlation. This is because the efficiency of galaxy formation is 
strongly (halo) mass dependent for the halos of interest at these 
redshifts, leading to strong curvature in the A/gai — Mh relation. In 
either scenario a modest scatter between halo or galaxy mass and 
observed quasar luminosity (arising, for example, from a combi- 
nation of scatters in the black-hole bulge, bulge galaxy and galaxy 
halo correlations and the Eddington ratio) leads to a shallow depen- 
dence of clustering on quasar luminosity, as observed. 

Future surveys of quasars which probe different regions of the 
luminosity-redshift plane will inform models of quasar formation 
and evolution. In this regard BOSS continues to measure quasar 
redshifts, and we expect the number of quasars in the luminos- 
ity and redshift range discussed here will be more than doubled 
by the end of the survey. It may be possible to incorporate the 
additional, BONUS quasars through cross-correlation or to cross- 
correlate spectroscopic and photometric quasar samples to better 
allow us to break the sample by luminosity, spectral or radio prop- 
erties. In addition BOSS is measuring redshifts for a large sample 
of z > 3 quasars, and analysis of those data will be crucial in un- 
derstanding the early phases of quasar growth. 
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APPENDIX A: REDSHIFT ERRORS 

At the signal-to-noise ratio and redshift at which BOSS is working 
it is difficult to obtain precise redshifts for quasars. Emission lines, 
such as Mgll which are good redshift indicators and which can be 
used at lower redshift, have redshifted into a relatively noisy part 
of the spectrum or off of the device altogether. One hour integra- 
tions on a 2.5 m telescope make it difficult to measure redshifts for 
quasars with weak lines. 

The BOSS pipeline measures quasar redshifts b y fitting their 
spectra to a set of PCA templates JAihara et al.ll201 II) plus a cubic 
polynomial to allow for changes in continuum slope. The reduced 
X^ vs. redshift is mapped in steps of AlogjQ(A) = 10^* from 
z — 0.0033 to 7 and the template fit with the best reduced x^ 
is selected as the redshift. In addition redshifts are computed by 
fitting any lines in two groups (forbidden and allowed ). A com- 
parison of redshifts deter mined from different lines (Hen nawi et al.l 
l2006bl : IShen et alj|2007l) . and visual inspections, suggests the au- 
tomatic redshifts are good to Az/{1 + z) ~ 0.003. At 2: = 2.5 
this corresponds to an error in the line-of-sight distance of roughly 
10 /i~^Mpc (comoving), which is significant compared to the cor- 
relation length of quasar clustering. We are attempting to improve 
our quasar redshift determination, but for now we simply account 
for the residual line-of-sight smearing induced by redshift errors in 
our fitting. 

In the limit 6 3> 1 the redshift-space halo correlation function 
is approximately isotropic and a power-law. If the redshift errors on 
quasars are uncorrelated and Gaussian distributed with fixed ampli- 
tude CTz the observed correlation function is 



U.{s) = U^ 



dZ 



2-Ka 



i(/s 



Z] e 



-(z-Z)^/2cr' 



(AI) 



where a = \/2{caz)/H{z). The integral, ^obs, divided by the in- 
put power-law is what we refer to in the main text as F{s). 

Fig. lAII compares this model, with a power-law correlation 
function of slope 1.8, to halos from N-body simulations with ap- 
plied Gaussian line-of-sight velocity errors. As expected, the agree- 
ment is excellent. In this test the redshift errors were all drawn from 
a Gaussian of the same dispersion. In the observational sample 
we might reasonably expect the errors to depend on the properties 
of each quasar. In this situation we should interpret ct as a pair- 
weighted, "effective", redshift error. The above tests, plus the clus- 
tering measurements themselves, are consistent with a per quasar 
redshift error corresponding to 10 h^^Mpc, and we use that as our 
fiducial value throughout. 



Figure Al. The effects of redshift en'ors on the redshift space (monopole) 
correlation function. The dashed line shows a power-law correlation func- 
tion with slope 1.8, and the other lines are for Eq. l lAll with per-object 
redshift errors corresponding to 2.5, 5 and 10 /i~^Mpc. The points are for 
mock quasars as in i|4.2| with Gaussian errors added to the line-of-sight ve- 
locities. 



APPENDIX B: QUASAR MODEL 

Models of the quasar phenomenon come in several basic flavors, 
but are in fact all quite similar The majority of models assume 
that quasar activity occurs due to the major merger of two gas-rich 
galaxies, since this scenario provides the rapid and violent event 
needed to funnel fuel to the center of the galaxy (e.g. via the bars- 
within-bars instability Shlosman et al. 1989) and feed the central 
engine while at the same time giving a connection between black 
hole fueling and the growth of a spheroidal stellar component. If 
black hole growth is feedback limited, it is only a rapidly grow- 
ing potential well that can host a rapidly accre ting hole. A notable 
exception is the models of Ciotti & OstrikeJ (ll997L l200lh which 
postulate the fuel is funneled to the center by thermal instabilities, 
which provide the rapid growth of the spheroidal component nec- 
essary for black hole growth. As we shall see, both sets of models 
can predict very similar halo occupancy. 

Some models implement the physics directly in numeri- 
cal simulations which attempt to track the hydrodynamics of 
the gas, with subgrid models for quasar and star formation 
and the ass ociated feedback (Sijacki et al. 2007; Hopkins et alj 
l2008l : lDeGrafetal.1 l20Ilh . Other models work at the level of 
dark matter halos, but follow the same physics - in simplified 
form - semi-analytically JCattaneo, Haehnelt & Ree a Il999l: 
[Kauffmann & Haehnelt' '2000'. 2002*: IVolonteri Haar dt & Madm 
12003; Bromlev, Somerville & Fab ianI 120041; IC ranato et alj 
2004; Crotonetal. 2006; Monaco. Fontanot & Taffoni 2007 



Malbon et al. ,2007. ; .Bonoli et ah ,20091 ; iFanidakis et aL ,2012 
Even more idealized are models which are built upon dark 
matter halos but use scaling relations or convenient functional 
form s to relate the quasar properties to those o f their host ha- 
los tefstathiou&Reed Il988l; ICarlberd I1990I; IWvithe & Loebl 



lost 
2OO2I 



20031 ; 'Haiman, Ciotti & Ostrikei"2004'; 'MaruUi et al. |l2006l ; 



Lidzetal.112 006; Croton 2009; Shen 2009; Booth & Schav^ boid) . 



A final level of abstraction is to simply provide a stochastic recipe 
for populating dark matter halos with quasars which is tuned to re- 
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produce the observations as bes t as possible while not attempting to 
follow the underlying physics jPorcia ni. Magliocchetti & Norberg 
20041 : IWhite, Martini &Cohn | | 2008i : Padmanabhan et al. 2009; 



Volonteri & StarM 
Krumpe et al.ii2012i : iKirkpatrick et aLll2012h . 



Shankar, Weinberg & Shen 



"l2010l: 



The modeling is simplified by several facts. Quasars are rare, 
their a ctivity times are short and the fraction of b inary quasars is 
small (iHennawi et alJ 12006 J : JMvers et al.ll2007bh . The hydrody- 
namic and semi-analj'tic models tend to reproduce the observed, 
2 = relation between black hole mass and halo, which is used 
as input to the scaling models. The models agree on the level of 
scatter in the relation (roughly a factor of two) and in broad brush 
on the evolution in the amplitude and slope of this relation with 
redshift. Some models invoke quasar feedback limited accretion 
explicitly while others achieve the same scaling relations without 
such a limit — e.g., by coupling the mechanisms by which bulges 
and black holes grow. At these masses and redshifts the satellite 
galaxy fraction is tiny, so the halo to stellar mass relation is set by 
abundance matching and any model which reproduces the galaxy 
stellar mass function will reproduce this relation. At high redshift 
bright quasars radiate near Eddington, so the models predict similar 
halo occupancies. 

The probability that a halo will undergo a major merger 
in a short redshift interval is only weakly dependent on the 
mass of the halo (Lacey&Cole 1993; Percival et al. 2003; 
Cohn & White 2005: Wet zel, Cohn & White! 20091 : iFakhouri & iv5l 
2009. : .Hopkins et aL.201Q") . i.e. the mass function of such halos is 



almost proportional to the mass function of the parent population. 
Similarly the clustering properties of recently merged halos are 
similar to a random sample of the population with the same mass 
distribution JPercival et all [20031: IWetzel. Cohn & White! |2009|) . 
Thus in any interval, Az, the fraction of halos of mass Mh which 
undergo a quasar event is almost independent of Mh and z and can 
be regarded as a random selection. This makes it difficult to infer 
that quasars arise from mergers simply from their large-scale clus- 
tering, but also implies that for the purposes of modeling the 1- and 
2-point functions of the quasar population it is sufficient to specify 
the halo occupation of the parent population (more complex mod- 
els may be needed if correlations between quasars and properties 
of e.g. galaxies were required). 

To determine whether a quasar candidate makes it into any 
sample it is necessary to relate the observed luminosity to the peak 
luminosity which is determined by Mbh- This is done either by 
specifying a light curve (e.g. a power-law, a power-law with a vary- 
ing slope or an exponential) or directly P{L\Mh) using the fact 
that the triggering rate is understood. For constant triggering rate a 
light curve L ~ t^^ implies P{L) ~ j^-^ii+'^iii ^ go a wide range 
of 7 maps to a narrow range in P{L) index. For the high luminos- 
ity thresholds we only see objects very near their peak brightness. 
Taking into account the factor ~ 2 scatter in Mbh at fixed bulge 
mass, we expect P[L\Mh) to be roughly lognormal with a similar 
width (it can be slightly broader due to variation in the Edding- 
ton ratio at peak). As we probe lower luminosities we are more 
likely to see older, more massive black holes leading to a low L tail 
in P (L\Mh) and more lower-Eddington-ratio objects (Lidz e t al.l 
I2OO6 : Shen 2009 : Cao 2010) . Unless we cut off the probability that 
a halo hosts a quasar at low a nd high halo mass, we will overpro- 
duce low and high L sources iLidz et al.ll2006l : ICrotonll2009l : IShenl 
120090 . In the physical models these limits occur due to lack of fuel 
in lo w mass halos and the i nability of gas to cool in high-mass halos 
(e.g. lCattaneo. Haehnelt & Rees 1999) . As discussed in ii4.2l mod- 



els which match the A/^ ~ hh relation for galaxies tend to roughly 
match the required quasar suppression. 

We are interested in the probability that a given halo hosts a 
quasar in our sample, e.g. fon{Mh) oc P{> LminlMh). From the 
arguments above we expect this relation to be an approximately 
lognormal function which is asymmetric towards high Mh at low 
L. It is quite difficult to put constraints on the detailed form of this 
function using luminosity function and clustering measurements. 

We argued above that the steepness of the halo mass function 
and the high bias of quasars implies that the quasar satellite frac- 
tion is small. The luminosity function thus provides a constraint on 
the duty cycle. (This extra degree of freedom reduces the ability 
of large-scale structure measurements to constrain the halo occu- 
pancy, compared to modeling galaxies.) The steeply falling mass 
function also implies that the number of quasars hosted in very 
massive halos is small regardless of the occupancy statistics of 
such halos. On scales larger than the virial radius of the typical 
quasar host halo (i.e. 200 — 300/i~^kpc) the 2-point function is 
dominated by pairs of quasars in different halos, and thus primarily 
measures the quasar-weighted halo bias which allows us to infer 
the mean, quasar-weighted halo mass. This result remains true for 
cross-correlation studies too, provided we work on scales larger 
than the virial radius of the quasar hosts. On smaller scales the am- 
plitude and slope of the correlation function allow us to measure 
a combination of the satellite fraction of quasars and the low mass 
cutoff. 
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